Convolutional Kitchen Sinks for Transcription Factor Binding Site Prediction
نویسندگان
چکیده
We present a simple and efficient method for prediction of transcription factor binding sites from DNA sequence. Our method computes a random approximation of a convolutional kernel feature map from DNA sequence and then learns a linear model from the approximated feature map. Our method outperforms state-ofthe-art deep learning methods on five out of six test datasets from the ENCODE consortium, while training in less than one eighth the time.
منابع مشابه
Dilated Convolutions for Modeling Long-Distance Genomic Dependencies
We consider the task of detecting regulatory elements in the human genome directly from raw DNA. Past work has focused on small snippets of DNA, making it difficult to model long-distance dependencies that arise from DNA’s 3-dimensional conformation. In order to study long-distance dependencies, we develop and release a novel dataset for a larger-context modeling task. Using this new data set w...
متن کاملDiscovery of Transcription Factor Binding Sites with Deep Convolutional Neural Networks
Transcription factors are key gene regulators, responsible for modulating the conversion of genetic information from DNA to RNA. Though these factors can be discovered experimentally, computational biologists have become increasingly interested in learning transcription factor binding sites from sequence data computationally. Though traditional machine learning architectures, including support ...
متن کاملInclusion of neighboring base interdependencies substantially improves genome-wide prokaryotic transcription factor binding site prediction
Prediction of transcription factor binding sites is an important challenge in genome analysis. The advent of next generation genome sequencing technologies makes the development of effective computational approaches particularly imperative. We have developed a novel training-based methodology intended for prokaryotic transcription factor binding site prediction. Our methodology extends existing...
متن کاملDeep Motif: Visualizing Genomic Sequence Classifications
This paper applies a deep convolutional/highway MLP framework to classify genomic sequences on the transcription factor binding site task. To make the model understandable, we propose an optimization driven strategy to extract “motifs”, or symbolic patterns which visualize the positive class learned by the network. We show that our system, Deep Motif (DeMo), extracts motifs that are similar to,...
متن کاملSmarter initializations in multi-modal neural networks to predict transcription factor binding
Various deep neural methods have been developed to predict transcription factor (TF) binding on DNA regions. In this paper we employ neural-network initialization techniques inspired by domain knowledge to augment current work in TF binding prediction. We train a Convolutional Neural Network (CNN) with the first layer of filters initialized in varying proportions to the values of position weigh...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017